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ABSTRACT 

Education researchers have long been concerned with 
finding the appropriate method for correlational analysis of 
hierarchical data. In recent years, the alternative of hierarchical 
linear modeling (HLM) has come into extensive use. HLM users 
typically center some or all student-level predictors either at the 
grand mean or at the school means. This procedure adds stability to 
the estimation process and leads to intercepts that are more readily 
interpretable. Centering also has the effect of changing the 
coefficients that are being estimated, and cannot be regarded as 
merely a technical device, when, in fact, it changes the research 
questions that are actually being asked. Some issues in centering are 
addressed theoretically and empirically. A combined equation for the 
two levels of modeling is presented, and some algebraic manipulation 
is used to show how each form of centering can be expected to modify 
the estimated coefficients. Data from the National Education 
Longitudinal Study are analyzed to investigate the effect of minority 
status on a mathematics achievement test score, comparing results 
based on centering with those based on raw data. Two tables 
illustrate the analyses. (SLD) 
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SOME OBSERVATIONS ON THE EFFECT OF CENTERING ON THE 
RESULTS OBTAINED FROM HIERARCHICAL LINEAR MODELING 



Education researchers have long been concerned with finding the 
appropriate method for correlational analysis of hierarchical data. 
With students nested within schools, for instance, it is not valid 
to enter both student- leve 1 and school-level predictors into a 
single, ordinary least squares (OLS) form of multiple regression. 
(One of the several problems with this traditional approach is that 
the number of degrees of freedom for the school-level coefficients 
is overstated, which increases the risk of Type I error when 
testing these coefficients.) 

In recent years, the alternative of hierarchical linear modeling 
(HLM) has come into extensive use. This analytic procedure models 
the predictors at their correct levels, and can be thought of as 
a two-stage process in which: 

1) First, student outcomes are modeled as a linear function of 
student predictors within each school; 

2) Next, t?ie coefficients from these school-level models are 
themselves modeled as a linear function of school-level 
predictors . 



[REF TO RAUDENBUSH & BRYK] 



In fact, however, the coefficients for the two levels of analysis 
are ef>tjmated simultaneously, using empirical Bayesian estimation. 

HLM users typically center some or all student-level predictors, 
either the grand mean or at the school means. This procedure 
is advised for two main reasons: It tends to add stability to the 
estimation process, and it leads to intercepts that are more readily 
interpre table , 

Centering also has the effect of changing the coefficients that are 
being estimated. The changes can be large when certain conditions 
obtain, and can lead researchers to very different conclusions, 
relative to those that they would have reached had they not used 
centering. This is not to say that centering is invalid or that it 
should be avoided. The point to note is that this procedure 
cannot be thought of solely as a technical device; It represents, 
rather, a change in the research questions that are being asked. 
And, of course, v;hen different questions are asked, different 
answers can be expected. 
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This paper addresses issues of centering, both theoretically and 
empirically. First , a combined equation for the two levels of 
modeling is presented, and some algebraic manipulation is used to 
show how each form of centering cjin be expected to modify the 
estimatea coefficients. Next/ data from the National Education 
Longitudinal Survey (NELS) are analyzed to investigate the effect 
of minority status on a mathematics achievement test score; the 
resulting based on centering are compared with those based on raw 
data. 

In addition to the centering issues, the paper is also concerned 
with the differences that may be expected between HLM results and 
analogous OLS results. To pursue this question, the empirical 
work was conducted using OLS, as well as HLM. 



A TWO-STAGE MODEL FOR MATHEMATICS ACHIEVEMENT 

Consider the prediction of a mathematics achievement test score from 
minority status data. We are concerned here with individual test 
scores, but wish to take account of the fact t'lat students are nested 
within schools, and that minority status can be measured at both the 
individual level and the school level. 



The use of a hierarchical model is indicated 
as follov;3: 



The model is expressed 



Student level: Lj =A '*"/^\Jl ^JL^ ^ ^Jl^ ; wVue 



YAojUi score "^or sVo^Cv^i \ \Yi Scl\ot>l 



School level 



The equations from the two levels can be combined to yield: 



We will focus our attention on the coefficients, and will not dwell 
on the error structure. It will suffice, for present purposes, to 
consider the equation for the expected value of the outcome: 
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Note that formulating the problem as a two-stage model leads to the 
inclusion of an interaction term in Equation (1); On can be thought 
of as the effect of a contextual variable (school-level proportion 
of minority students) on the expected difference between minority 
and non-minority students. 



THE EFFECT OF CENTERING THE INDIVIDUAL-LEVEL PREDICTOR 

Equation (1) uses raw data, in the sense that the student-level 
predictor is coded in zero/one form. This is the traditional 
approach to estimating the coefficients by means of ordinary least 
squares (OLS) analysis. 

In the hierarchical linear modeling (HLM) context, centering of 
student- level predictors is usually advised. Two centering trans- 
formations are in common use: centering at the grand mean, or at the 
group means (which in our case are school means) . To gain some 
insight into the first of these options, let ^ denote the grand mean 
of the minority status variable, i.e., the proportion of minority 
students across the total population of schools, and^then rewrite 
Equation (1) as: ^..^ _ V,o + ^^o C^J^ - + Y»o ^ O^J. 

V'n (-Xj,, -a) :^aj- VuaJjL 

Thus, to the extent that the estimation process is unaffected by the 
centering transformation, we can expect grand-mean centering to: 

1) riave no effect on the coefficients of either the student- 
level predictor or the interaction; 

2) Change th^ intercept from )Joo Voo+Xo"^ ' 

3) Change the coefficient of the school-level predictor from Vqi 

to Yo, ^ 

The change to the school-level effect seems counterintuitive, since 
the centering operation affects only the student-level data, and is 
merely a linear transformation of those data- Still, it appears that 
grand-mean centering might make the difference betv/een a statist- 
ically significant school effect and one that is not significant. 
(But note that this difference should occur only when a non-zero 
interaction effect is present.) 

With school-mean centering, Equation (1) can be rewritten in the 
following way: 
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It appears that, as with grand-mean centering, the coefficients for 
the student-level predictor and the interaction should not change, 
while the coeficient for the school effect should — in this case, from 

Xq\ . The situation is further complicated, however, by 
a nev7 term: the quadratic term in 0(x • Since this term is not 
part of the model, it can be expected to alter the coefficients in an 
unknown way» 

The lack of equivalence between Equations (1) and (3) is not 
surprising, since group-mean centering, unlike grand-mean centering, 
is a more radica] process than applying a linear transformation to 
existing predictors. The group-centered predictor is not a variant 
of the uncentered predictor; It is a new and different variable. 

The analogues to Equations (2) and (3) can be developed for the case 
in which the school-level variable is substantively different from 
the student-level variable, rather than being an aggregation of the 
latter variable up to the school le^^el . For grand-mean centering, 
the analogue to Equation (2) is essentially no different from that 
equation, suggesting that the effect of this type of centering can 
be predicted from the results of the uncentered model. 

For group-mean centering, this is not the case. There is no useful 
way to group terms in the resulting equation, and the effect of 
centering is even less predictable than it v;as in the simpler 
situation . 

The empirical work to be presented here is based on models that 
incorporate both types of school-level predictor: The aggregated 
version of the student- level measure of student minority status, 
and a school-level measure of faculty minority status. Note that 
the equations developed above are based on one predictor at each 
level , and cannot be expected to hold for more compl icated models . 
They may, hov;ever, provide useful approximations to the results that 
are obtained empirically. 



METHODOLOGy 

The issues discussed above were investigated empirically, using 
data from the National Education Longitudinal Survey (NELS), a 
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national longitudinal survey conducted by NCES . The survey is based 
on a two-stage design. For the base year, schools were sampled at 
the first stage, and then 8th grade students were sampled within 
schools. For the analyses to be presented here, the f:ample was 
restricted to 

[DEFINE THE SUBSET USED HERE] 

All analyses included the following variables: 

Outcome : 



A (continuous) score on a mathematics 
achievement test 

Dummy variable for Black/non-Black 
Dummy variable for Hispanic/non-Hispanic 
Continuous measure of socioeconomic status 
Continuous measure of absentee ism/ tardiness 

Proportion of minority students 
Proportion of minority faculty 
Dummy variable for urban location 
Dummy var iable for rural locat ion 
Proportion of low-SES students 

Note that, with this mode], student-level minority status is treated 
cs two variables: a dummy variable for Black, and a second dummy 
variable for Hispanic. 



Student-Level 
Predictors : 



School-Level 
Predi ctors : 



These 
way : 



variables were incorporated into HLM analyses in the following 



1) The intercepts from the student-level analyses were 
modeled as functions of all school-level variables; 

2) The slopes for the Black and Hispanic variables were 
modeled as functions of the two minority-proportion 
school-level variables ; 



3) The SES and absent/tardy variables were treated as fixed 
effects . 

Thus, in addition to the main effects at the two levels of analysis, 
four interactions are included in the model: each of the student- 
level minority variables crossed with each of the school-level 
minority-proportion variables. 

Three HLM runs will be presented: one with the two student- leve 1 
minority variables uncentered, one with these variables grand-mean- 
centered, and one in which they are group-mean centered. (The other 
two student-level variables were group-mean centered for all runs.) 

These three setups were also investigated using OLS . 

[DESCRIBE SOFTWARE] 
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RESULTS: STUDENT MINORITY STATUS PREDICTORS 

The coefficients and associated p-values are shown in Table 1, for 
all terms involving minority status and for all six of the models 
that are under consideration. 

The results of thi^ HLM analyses and the OLS analyses are essentially 
the same, and we will disr;uss these results in terms of the numerical 
values from the HLM analyses. (Note, though, that the similarity of 
the two methods, with regard to the p-values as well as the 
coefficients, is itself of some interest.) 

Moving from no centering to grand-mean centering has virtually no 
effect on the coefficients. This is consonant with Equation (2), 
because the interactions in the uncentered model are close to zero 
and are not significant. However, the small change observed in the 
school-level coefficient (from 0.034 to 0.050) is positive, and this 
is consistent with the fact that the interaction coefficients (and 
the grand-mean proportion of minority students) are positive. 

Going from no centering to group-mean centering increases the 
school-level effect from a non-significant value (gamma = 0.034, 
p = 0.783) to a significantly negative value (gamma = -0,187, 
p = 0.040). This change follows from Equation (3), since the 
(highly significant) student-level effect is incorporated into the 
school-leve] effect when group-mean centering is used. 



RESULTS: FACULTY MINORITY STATUS PREDICTOR 

The models that have been tested all contain a main effect for the 
school-level variable measuring the proportion of minority faculty, 
and for the interactions involving this variable and the tv;o 
student-level minority status variables. The results of the 
analyses, for these three variables, are shown in Table 2. 

The effect of group-mean centering on the coefficient of the school- 
level variable is again apparent, although this effect is much more 
pronounced for the HLM analysis than for the OLS analysis. It is 
important to note that Equation (3) holds only for the situation 
in which the school-level variable is the aggregated version of the 
student- level variable. When the two variables are substantively 
different, as is the case here, the effect of centering cannot be 
predicted in any simple way. 

With the student-level minority status variables uncenterd, the 
HLM model shows a negative relationship between proportion of 
minority teachers and mathematics score (gamma = -0.176, p = 0.024). 
With group centering, the relationship is attenuated and 3s no longer 
significant (gamma = -0.053, p = 0.222). The directionality of the 
change seems surprising. The student-leve] effect, rather than 
augmenting the school-level effect, tends to cancel it. 
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Table 1. Results of HLM and OLS analysis of the effect of 
minority status on mathematics achievement score 



Model 




HLM 
Coef f 


D Vn 


OLS 
Coef f 


Pr ob 


NO CENTERING 


Stdnt-lev (B) 


-3 


612 


[ .000) 


-3 


706 


( . 000) 




Stdnt-lev (H) 


— 1 


V! 1 Q 

4 1 o 


[ .037) 




D4 o 


( .016) 




Sch-lev 


u 


n Q A 


( .783) 






( .842) 






0 


026 


t Q 1 ^ 
, .^10; 


0 


133 


/ R 9 C \ 




Interact (H) 


0 


027 


( . 899) 


0 


129 


( .468 ) 


GRAND-MEAN 


Stdnt-lev ' (E) 


-3 


611 


( . 000) 


-3 


706 


( . 000) 


CENTERING 


Stdnt-lev (H) 


-1 


418 


( . 037 ) 


-1 


543 


( . 016) 




Sch-lev 


0 


050 


( .639) 


0 


058 


( .492) 




Interact (B) 


0 


025 


( .915) 


0 


133 


( . 526) 




Interact (H) 


0 


027 


( . 890) 


0 


129 


( .468) 


GROUP-MEAN 


Stdnt-lev (B) 


-3 


.198 


( . 000) 


-3 


097 


( . 000) 


CENTERING 


Stdnt-lev (H) 


-2 


. 199 


( .009) 


-2 


.129 


{ .016) 




Sch-lev 


-0 


. 187 


( .040) 


-0 


. 163 


( . 013 ) 




Interact (B) 


-0 


. 209 


( .488) 


-0 


. 221 


( .481 ) 




Interact (H) 


0 


. 101 


( .677) 


0 


. 112 


( . 661 ) 



Stdnt-lev (B) = 1 for Black non-Hispanic, = 0 otherwise 
Stdnt-lev (H) = 1 for Hispanic, = 0 otherwise 
Sch-lev = proportion minority students in school 
Interact (B) = interaction between Stdnt-lev (B) and Sch-lev 
, Interact (H) = interaction between Stdnt-lev (H) and Sch-lev 
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Table 2. Results of HLM and OLS analysis of the effect of 

the pi'oportion of minority faculty on mathematics 
achievement score 



Model 



NO CENTERING 



Sch-lev 

Interact (B) 
Interact (H) 



HLM 
Coeff 



Prob 



-0.176 (.024) 



0. 227 
0. 159 



( .017) 
( .078) 



OLS 
Coeff 



-0.141 

0 .171 
0.121 



Prob 



( .049) 

( .051 ) 
( . 150) 



GRANH-MEAN 
CENTERING 



Sch-lev 

Interact (B) 
Interact (H) 



-0.063 ( . 179) 



0. 227 
0. 159 



( .017) 
( .077) 



-0.053 (.132) 



0.171 
0.121 



( .051) 
( . 150) 



GROUP-MEAN 
CENTERING 



Sch-lev 

Interact (B) 
Interact (H) 



•0.053 (.222) 



0.316 
0.171 



( . 007 ) 
( . 106) 



•0.052 (.073) 



0.321 
0.156 



( .008) 
( .159) 



Sch-lev = proportion minority faculty in school 

Stdiit-lev (B) = 1 for Black non-Hispanic, = 0 otherwise 

Stdnt-lev (H) = 1 for Hispanic, = 0 otherwise 

Inter&ct (B) = interac+ion between Stdnt-lev (B) and Sch-lev 

Interact (H) - interaction between Stdnt-lev (H) and Sch-lev 
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Grand-mean centering also affects the coefficient of the school-level 
measure, reducing it to -0.063, which is not significantly different 
Trom zero (p - 0.179). This change is predictable, since Equation 
(2) is valid, regardless of whether or not the school-level variable 
is an aggregated version of the student-level variable. The change 
is attributable to the significantly positive interaction effects, 
which partially cancel the original negative effect of the school- 
level variable. 



CONCLUSIONS 

As suggested by Equations (2) and (3), centering can al^^er the 
conclusions that the researcher draws from the data analysis. For 
mocJels in which the same var iable--student minority status, in our 
case~-is being considered at both levels of analysis, the school- 
-level component can assume signif ilcance or lose significance as 
a result of centering. 

For the particular datasel and models considered here, a researcher 
using uncentered or grand-mean centered student minority status 
would conclude that jt is only the individual student's minority 
status that affects that student's mathematics achievement. But 
a second researcher, choosing to group-mean center the same variable, 
would conclude that the average m.inority status of the student's 
school also has a significant effect. Neither conclusion is 
incorrect. Each one, however, is tied to the centering upon which 
it is based. 

Using Equation (2), the researchers can, to some extent at ]east, 
predict each other's results. But this is seldom done. The normal 
procedure is to seDect one or another centering option for reasons 
that are either technical or, if substantive, only defensible in 
vague terms, and then to interpret the results of the analysis 
without regard to the centering. 

VJith regard to the significance of average faculty minority status, 
all researchers would agree that this variable interacts with the 
individuaJ student's minority status to influence mathematics 
achievement. A researcher using uncentered data would also conclude 
that that the faculty variable itself affects achievement, while 
researchers using either of the centering options would not observe 
such an effect. Again, differences in centering lead to different 
findings; in this case, to findings that are more difficult to 
reconciJe on an algebraic basis. 

The above remarks are based on HLM analysis, but apply to OLS 
analysis as v;ell; The effects of centering are not limited to HLM, 
but apply to more traditional forms of regression as well. Finally, 
it is interesting to note that no strong or consistent difference 
was found between KLM and OLS with regard to significance level of 
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school predictors. It was expected, since OLS overstates the 
degrees of freedom for these predictors, that this procedure would 
be too lenient in finding significant relationships. However, among 
the six school-level tests shown in Tables 1 and 2, it is only the 
group-centered model shown in Table 2 for which this phenomenon is 
clearly present. 
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